Prediction of transcriptional regulatory sites in the complete genome sequence of Escherichia coli K-12
نویسندگان
چکیده
MOTIVATION As one of the best-characterized free-living organisms, Escherichia coli and its recently completed genomic sequence offer a special opportunity to exploit systematically the variety of regulatory data available in the literature in order to make a comprehensive set of regulatory predictions in the whole genome. RESULTS The complete genome sequence of E.coli was analyzed for the binding of transcriptional regulators upstream of coding sequences. The biological information contained in RegulonDB (Huerta, A.M. et al., Nucleic Acids Res.,26,55-60, 1998) for 56 different transcriptional proteins was the support to implement a stringent strategy combining string search and weight matrices. We estimate that our search included representatives of 15-25% of the total number of regulatory binding proteins in E.coli. This search was performed on the set of 4288 putative regulatory regions, each 450 bp long. Within the regions with predicted sites, 89% are regulated by one protein and 81% involve only one site. These numbers are reasonably consistent with the distribution of experimental regulatory sites. Regulatory sites are found in 603 regions corresponding to 16% of operon regions and 10% of intra-operonic regions. Additional evidence gives stronger support to some of these predictions, including the position of the site, biological consistency with the function of the downstream gene, as well as genetic evidence for the regulatory interaction. The predictions described here were incorporated into the map presented in the paper describing the complete E.coli genome (Blattner,F.R. et al., Science, 277, 1453-1461, 1997). AVAILABILITY The complete set of predictions in GenBank format is available at the url: http://www. cifn.unam.mx/Computational_Biology/E.coli-predictions CONTACT [email protected], [email protected]
منابع مشابه
Microbial computational genomics of gene regulation*
Escherichia coli is a free-living bacterium that condensates a large legacy of knowledge as a result of years of experimental work in molecular biology. It represents a point of departure for analyses and comparisons with the ever-increasing number of finished microbial genomes. For years, we have been gathering knowledge from the literature on transcriptional regulation and operon organization...
متن کاملDecoding genome-wide GadEWX-transcriptional regulatory networks reveals multifaceted cellular responses to acid stress in Escherichia coli
The regulators GadE, GadW and GadX (which we refer to as GadEWX) play a critical role in the transcriptional regulation of the glutamate-dependent acid resistance (GDAR) system in Escherichia coli K-12 MG1655. However, the genome-wide regulatory role of GadEWX is still unknown. Here we comprehensively reconstruct the genome-wide GadEWX transcriptional regulatory network and RpoS involvement in ...
متن کاملAcquired Antimicrobial Resistance Genes of Escherichia coli Obtained from Nigeria: In silico Genome Analysis
Background: Antimicrobial resistance is a global problem with enormous public health and economic impact. This study was carried out to get an overview of acquired antimicrobial resistance gene sequences in the genomes of Escherichia coli isolated from different food sources and the environment in Nigeria. Methods: To determine the acquired antimicrobial-resistant genes prevalence, genome asse...
متن کاملFinished Genome Sequence of the Laboratory Strain Escherichia coli K-12 RV308 (ATCC 31608)
Escherichia coli strain K-12 substrain RV308 is an engineered descendant of the K-12 wild-type strain. Like its ancestor, it is an important organism in biotechnological research and is heavily used for the expression of single-chain variable fragments. Here, we report the complete genome sequence of E. coli K-12 RV308 (ATCC 31608).
متن کاملFinished Genome Sequence of Escherichia coli K-12 Strain HMS174 (ATCC 47011)
Escherichia coli strain K-12 substrain HMS174 is an engineered descendant of the E. coli K-12 wild-type strain. Like its ancestor, it is an important organism in biotechnological research and is used in fermentation processes for heterologous protein production. Here, we report the complete genome sequence of E. coli HMS174 (ATCC 47011).
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Bioinformatics
دوره 14 5 شماره
صفحات -
تاریخ انتشار 1997